AITopics | esg report

Collaborating Authors

esg report

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

EulerESG: Automating ESG Disclosure Analysis with LLMs

Ding, Yi, Tang, Xushuo, Yang, Zhengyi, Zhang, Wenqian, Wu, Simin, Huang, Yuxin, Lan, Lingjing, Li, Weiyuan, Chen, Yin, Ju, Mingchen, Yang, Wenke, Hoang, Thong, Klymenko, Mykhailo, Zu, Xiwei, Zhang, Wenjie

arXiv.org Artificial IntelligenceDec-1-2025

Environmental, Social, and Governance (ESG) reports have become central to how companies communicate climate risk, social impact, and governance practices, yet they are still published primarily as long, heterogeneous PDF documents. This makes it difficult to systematically answer seemingly simple questions. Existing tools either rely on brittle rule-based extraction or treat ESG reports as generic text, without explicitly modelling the underlying reporting standards. We present \textbf{EulerESG}, an LLM-powered system for automating ESG disclosure analysis with explicit awareness of ESG frameworks. EulerESG combines (i) dual-channel retrieval and LLM-driven disclosure analysis over ESG reports, and (ii) an interactive dashboard and chatbot for exploration, benchmarking, and explanation. Using four globally recognised companies and twelve SASB sub-industries, we show that EulerESG can automatically populate standard-aligned metric tables with high fidelity (up to 0.95 average accuracy) while remaining practical in end-to-end runtime, and we compare several recent LLM models in this setting. The full implementation, together with a demonstration video, is publicly available at https://github.com/UNSW-database/EulerESG.

euleresg, large language model, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2511.21712

Country: North America > United States (0.14)

Genre: Research Report (0.50)

Industry: Banking & Finance (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Pharos-ESG: A Framework for Multimodal Parsing, Contextual Narration, and Hierarchical Labeling of ESG Report

Chen, Yan, Zou, Yu, Zeng, Jialei, You, Haoran, Zhou, Xiaorui, Zhong, Aixi

arXiv.org Artificial IntelligenceNov-21-2025

Environmental, Social, and Governance (ESG) principles are reshaping the foundations of global financial gover- nance, transforming capital allocation architectures, regu- latory frameworks, and systemic risk coordination mecha- nisms. However, as the core medium for assessing corpo- rate ESG performance, the ESG reports present significant challenges for large-scale understanding, due to chaotic read- ing order from slide-like irregular layouts and implicit hier- archies arising from lengthy, weakly structured content. To address these challenges, we propose Pharos-ESG, a uni- fied framework that transforms ESG reports into structured representations through multimodal parsing, contextual nar- ration, and hierarchical labeling. It integrates a reading-order modeling module based on layout flow, hierarchy-aware seg- mentation guided by table-of-contents anchors, and a multi- modal aggregation pipeline that contextually transforms vi- sual elements into coherent natural language. The framework further enriches its outputs with ESG, GRI, and sentiment labels, yielding annotations aligned with the analytical de- mands of financial research. Extensive experiments on anno- tated benchmarks demonstrate that Pharos-ESG consistently outperforms both dedicated document parsing systems and general-purpose multimodal models. In addition, we release Aurora-ESG, the first large-scale public dataset of ESG re- ports, spanning Mainland China, Hong Kong, and U.S. mar- kets, featuring unified structured representations of multi- modal content, enriched with fine-grained layout and seman- tic annotations to better support ESG integration in financial governance and decision-making.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2511.16417

Country: Asia > China > Hong Kong (0.25)

Genre: Research Report (1.00)

Industry: Banking & Finance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation

Wu, Qilong, Xiang, Xiaoneng, Huang, Hejia, Wang, Xuan, Jie, Yeo Wei, Satapathy, Ranjan, Filho, Ricardo Shirota, Veeravalli, Bharadwaj

arXiv.org Artificial IntelligenceDec-14-2024

The rapid growth of the financial sector and the rising focus on Environmental, Social, and Governance (ESG) considerations highlight the need for advanced NLP tools. However, open-source LLMs proficient in both finance and ESG domains remain scarce. To address this gap, we introduce SusGen-30K, a category-balanced dataset comprising seven financial NLP tasks and ESG report generation, and propose TCFD-Bench, a benchmark for evaluating sustainability report generation. Leveraging this dataset, we developed SusGen-GPT, a suite of models achieving state-of-the-art performance across six adapted and two off-the-shelf tasks, trailing GPT-4 by only 2% despite using 7-8B parameters compared to GPT-4's 1,700B. Based on this, we propose the SusGen system, integrated with Retrieval-Augmented Generation (RAG), to assist in sustainability report generation. This work demonstrates the efficiency of our approach, advancing research in finance and ESG.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2412.10906

Country:

North America > United States > Louisiana (0.04)
North America > United States > Mississippi (0.04)
North America > United States > Arkansas (0.04)
(5 more...)

Genre:

Research Report (0.82)
Public Relations > Community Relations (0.81)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Banking & Finance > Trading (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ML-Promise: A Multilingual Dataset for Corporate Promise Verification

Seki, Yohei, Shu, Hakusen, Lhuissier, Anaïs, Lee, Hanwool, Kang, Juyeon, Day, Min-Yuh, Chen, Chung-Chi

arXiv.org Artificial IntelligenceNov-7-2024

Promises made by politicians, corporate leaders, and public figures have a significant impact on public perception, trust, and institutional reputation. However, the complexity and volume of such commitments, coupled with difficulties in verifying their fulfillment, necessitate innovative methods for assessing their credibility. This paper introduces the concept of Promise Verification, a systematic approach involving steps such as promise identification, evidence assessment, and the evaluation of timing for verification. We propose the first multilingual dataset, ML-Promise, which includes English, French, Chinese, Japanese, and Korean, aimed at facilitating in-depth verification of promises, particularly in the context of Environmental, Social, and Governance (ESG) reports. Given the growing emphasis on corporate environmental contributions, this dataset addresses the challenge of evaluating corporate promises, especially in light of practices like greenwashing. Our findings also explore textual and image-based baselines, with promising results from retrieval-augmented generation (RAG) approaches. This work aims to foster further discourse on the accountability of public commitments across multiple languages and domains.

dataset, extraction 0, rag, (14 more...)

arXiv.org Artificial Intelligence

2411.04473

Country:

North America > United States (0.15)
Europe > France (0.05)
Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.05)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry: Social Sector (0.72)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs

Mishra, Lokesh, Dhibi, Sohayl, Kim, Yusik, Ramis, Cesar Berrospi, Gupta, Shubham, Dolfi, Michele, Staar, Peter

arXiv.org Artificial IntelligenceJun-27-2024

Environment, Social, and Governance (ESG) KPIs assess an organization's performance on issues such as climate change, greenhouse gas emissions, water consumption, waste management, human rights, diversity, and policies. ESG reports convey this valuable quantitative information through tables. Unfortunately, extracting this information is difficult due to high variability in the table structure as well as content. We propose Statements, a novel domain agnostic data structure for extracting quantitative facts and related information. We propose translating tables to statements as a new supervised deep-learning universal information extraction task. We introduce SemTabNet - a dataset of over 100K annotated tables. Investigating a family of T5-based Statement Extraction Models, our best model generates statements which are 82% similar to the ground-truth (compared to baseline of 21%). We demonstrate the advantages of statements by applying our model to over 2700 tables from ESG reports. The homogeneous nature of statements permits exploratory data analysis on expansive information found in large collections of ESG reports.

large language model, machine learning, node, (18 more...)

arXiv.org Artificial Intelligence

2406.19102

Country: Europe (1.00)

Genre: Research Report (0.81)

Industry:

Water & Waste Management > Solid Waste Management (0.67)
Energy > Oil & Gas (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced Analysis

Peng, Jiahui, Gao, Jing, Tong, Xin, Guo, Jing, Yang, Hang, Qi, Jianchuan, Li, Ruiqiao, Li, Nan, Xu, Ming

arXiv.org Artificial IntelligenceJan-4-2024

In the evolving field of corporate sustainability, analyzing unstructured Environmental, Social, and Governance (ESG) reports is a complex challenge due to their varied formats and intricate content. This study introduces an innovative methodology utilizing the "Unstructured Core Library", specifically tailored to address these challenges by transforming ESG reports into structured, analyzable formats. Our approach significantly advances the existing research by offering high-precision text cleaning, adept identification and extraction of text from images, and standardization of tables within these reports. Emphasizing its capability to handle diverse data types, including text, images, and tables, the method adeptly manages the nuances of differing page layouts and report styles across industries. This research marks a substantial contribution to the fields of industrial ecology and corporate sustainability assessment, paving the way for the application of advanced NLP technologies and large language models in the analysis of corporate governance and sustainability. Our code is available at https://github.com/linancn/TianGong-AI-Unstructure.git.

application, esg report, unstructured data, (10 more...)

arXiv.org Artificial Intelligence

2401.02992

Country:

North America > United States (0.28)
Asia > Japan (0.04)
Asia > China (0.04)

Genre: Research Report (0.82)

Industry:

Information Technology > Software (0.41)
Law (0.34)
Retail (0.31)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.95)
(2 more...)

Add feedback

ESGReveal: An LLM-based approach for extracting structured data from ESG reports

Zou, Yi, Shi, Mengying, Chen, Zhongjie, Deng, Zhu, Lei, ZongXiong, Zeng, Zihan, Yang, Shiming, Tong, HongXiang, Xiao, Lei, Zhou, Wenwen

arXiv.org Artificial IntelligenceDec-25-2023

ESGReveal is an innovative method proposed for efficiently extracting and analyzing Environmental, Social, and Governance (ESG) data from corporate reports, catering to the critical need for reliable ESG information retrieval. This approach utilizes Large Language Models (LLM) enhanced with Retrieval Augmented Generation (RAG) techniques. The ESGReveal system includes an ESG metadata module for targeted queries, a preprocessing module for assembling databases, and an LLM agent for data extraction. Its efficacy was appraised using ESG reports from 166 companies across various sectors listed on the Hong Kong Stock Exchange in 2022, ensuring comprehensive industry and market capitalization representation. Utilizing ESGReveal unearthed significant insights into ESG reporting with GPT-4, demonstrating an accuracy of 76.9% in data extraction and 83.7% in disclosure analysis, which is an improvement over baseline models. This highlights the framework's capacity to refine ESG data analysis precision. Moreover, it revealed a demand for reinforced ESG disclosures, with environmental and social data disclosures standing at 69.5% and 57.2%, respectively, suggesting a pursuit for more corporate transparency. While current iterations of ESGReveal do not process pictorial information, a functionality intended for future enhancement, the study calls for continued research to further develop and compare the analytical capabilities of various LLMs. In summary, ESGReveal is a stride forward in ESG data processing, offering stakeholders a sophisticated tool to better evaluate and advance corporate sustainability efforts. Its evolution is promising in promoting transparency in corporate reporting and aligning with broader sustainable development aims.

esgreveal, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2312.17264

Country: Asia > China > Hong Kong (0.26)

Genre: Research Report > New Finding (1.00)

Industry:

Energy > Oil & Gas (1.00)
Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ESG Accountability Made Easy: DocQA at Your Service

Mishra, Lokesh, Berrospi, Cesar, Dinkla, Kasper, Antognini, Diego, Fusco, Francesco, Bothur, Benedikt, Lysak, Maksym, Livathinos, Nikolaos, Nassar, Ahmed, Vagenas, Panagiotis, Morin, Lucas, Auer, Christoph, Dolfi, Michele, Staar, Peter

arXiv.org Artificial IntelligenceNov-30-2023

We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document conversion to machine-readable format (via computer vision), finding relevant data (via natural language processing), and formulating an eloquent response (via large language models). Users can explore over 10,000 Environmental, Social, and Governance (ESG) disclosure reports from over 2000 corporations. The Deep Search platform can be accessed at: https://ds4sd.github.io.

esg report, information, international conference, (15 more...)

arXiv.org Artificial Intelligence

2311.18481

Country:

Europe > Switzerland > Zürich > Zürich (0.16)
North America > United States (0.15)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.05)
(4 more...)

Genre: Research Report (0.40)

Industry: Banking & Finance (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)

Add feedback

A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports

Wang, Xinyu, Gui, Lin, He, Yulan

arXiv.org Artificial IntelligenceOct-27-2023

Table of contents (ToC) extraction centres on structuring documents in a hierarchical manner. In this paper, we propose a new dataset, ESGDoc, comprising 1,093 ESG annual reports from 563 companies spanning from 2001 to 2022. These reports pose significant challenges due to their diverse structures and extensive length. To address these challenges, we propose a new framework for Toc extraction, consisting of three steps: (1) Constructing an initial tree of text blocks based on reading order and font sizes; (2) Modelling each tree node (or text block) independently by considering its contextual information captured in node-centric subtree; (3) Modifying the original tree by taking appropriate action on each tree node (Keep, Delete, or Move). This construction-modelling-modification (CMM) process offers several benefits. It eliminates the need for pairwise modelling of section headings as in previous approaches, making document segmentation practically feasible. By incorporating structured information, each section heading can leverage both local and long-distance context relevant to itself. Experimental results show that our approach outperforms the previous state-of-the-art baseline with a fraction of running time. Our framework proves its scalability by effectively handling documents of any length.

esgdoc, information, node, (15 more...)

arXiv.org Artificial Intelligence

2310.18073

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(5 more...)

Genre:

Collection (0.61)
Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

ESG Report: Machine Learning Delivers Better IT Insights

#artificialintelligenceJul-28-2018, 01:07:15 GMT

Download the research report from ESG, and learn how predictive analytics and machine learning can help optimize application performance and meet the needs of the business with Nimble Storage Infosight. The performance of enterprise applications will have a direct impact on business activities and outcomes. The quality of the delivery of applications will depend on how smoothly the underlying data infrastructure operates.

artificial intelligence, esg report, machine learning, (1 more...)

#artificialintelligence

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.75)

Add feedback